Non-Markovian fluctuations in Markovian models of protein dynamics 
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Recent experiments using fluorescence spectroscopy have been able to probe the dynamics of con- 
formational fluctuations in proteins. The fluctuations are Gaussian but do not decay exponentially, 
and are therefore, non-Markovian. We present a theory where non-Markovian fluctuation dynamics 
emerges naturally from the superposition of the Markovian fluctuations of the normal modes of the 
protein. A Rouse-like dynamics of the normal modes provides very good agreement to the exper- 
imentally measured correlation functions. We provide simple scaling arguments rationalising our 
results. 



Proteins molecules are the buiding blocks of life. The 
three-dimensional physical structure of a protein is inti- 
mately related to its biological function. Proteins func- 
tion in an environment where noise is ubiquitous. The 
three-dimensional conformation of a protein, therefore, is 
not static but itself undergoes fluctuations. Since protein 
function is so strongly determined by protein structure, 
the fluctuating nature of a protein molecule has impor- 
tant implications for its biological function [3, . A pre- 
cise determination of the static and dynamic properties 
of conformational fluctuations in proteins is, therefore, of 
great importance. 

In recent experiment [3[ , conformational fluctuations of 
the protein flavin reductase have been observed and char- 
acterised using single-molecule fluorescence spectroscopy. 
The fluorescence lifetime is directly correlated to the 
distance between the flavin (fluorophore) and tyrosine 
(quencher) groups within the protein. The distance fluc- 
tuations between the flavin and tyrosine groups, then, 
gives an indirect measure of the fluctuations of the entire 
protein. Remarkably, the experiments find that the fluc- 
tuations remain correlated over a five decades in time, 
spanning the range of 10~ 4 s - Is. This is indicative of 
the presence of multiple relaxation mechanisms operating 
at different time scales. Further, there is convincing evi- 
dence that the fluctuations are Gaussian, and when taken 
together with the absence of single-exponential decay of 
the correlations, imply that they are also non-Markovian. 

In this Letter, we show how a Markovian dynam- 
ics for the protein normal modes generically produces 
non-Markovian fluctuations in the distance between two 
residues on the protein backbone. Imposing the simplest 
Rouse-like dynamics for the normal modes we obtain all 
correlation functions for the distance fluctuations and 
find them to be in very good agreement with experiments 
@: 0, HI • Simple scaling arguments are provided to ratio- 
nalise our analytical calculations. 

Before presenting our detailed calculation, we illustrate 
the basic mechanism by which non-Markovian behaviour 
arises in this problem. Consider the Ornstein-Uhlenbeck 
process (OUP) which describes the velocity v(t) of Brow- 
nian motion. This is a stationary, Gaussian, Markovian 



process with a correlation function po( r ) = ( v {t) v (t + 
t)) = fcsTexp(— Tr), where is the relaxation time. 
Take two such uncorrelated processes vi(t) and V2(t), 
each with distinct relaxation times Ti and T2, and ask 
for properties of the stochastic process described by their 
sum u(t) — vi(t) + V2{t). Since each Vi is Gaussian and 
stationary, so is their sum u. The correlation function of 
the sum is p(r) = (u(t)u(t + r)) = ((vi (t) + v 2 (t))(v 1 (t + 
t) + V2(t + t))) , and since the processes are uncorrelated, 
is the sum of the correlation functions of the individual 
processes, p(r) = kBT\exp(— Tit) +cxp(— r 2 r)]. Then, 
from Doob's theorem [6|], which says that a Gaussian, 
stationary process is Markovian if and only if its corre- 
lation function is a single exponential, we conclude that 
u(t) is Gaussian, stationary, but non-Markovian. Gener- 
alising, an abitrary superposition of iV distinct but un- 
correlated OUPs, u(t) = J2iLi a i v i(t)i is a l so Gaussian, 
stationary, and non-Markovian. Thus, non-Markovian 
behaviour can arise very generally from a superposition of 
Markovian processes. It precisely this mechanism which, 
as we show below, generates the non-Markovian fluctua- 
tions seen in protein dynamics. 

A minimal model of a protein replaces the complicated 
stereochemistry of the amino acids and its associated 
secondary and tertiary structures by a simple connected 
chain of beads and springs [3] • The relaxations in such 
a model derive from interactions between the beads and 
the combined effect of fluctuations and dissipation due to 
the solvent. In the energetic ground state, the conforma- 
tion is labelled by the positions R° of the beads, where 
the subscript n is a position label along the chain. In- 
dependent of the specific nature of the interactions, con- 
formational fluctuations about the ground state can be 
described by a parametrisation R n = R° +u„. Here, u„ 
is the deviation of the n— th bead from its ground state 
conformation. Then, the instantaneous distance d m „(i) 
between two monomers located at m and n is given by 
d mn (t) = R m (t) - R„(t) = d° mn + u m (t) - u„(i), where 
d° nn = R° ra — R° is the equilibrium distance. For a chain 
whose ends are not tethered and therefore free of external 
forces, the displacements must satisfy du n /dn — at n — 
and n ~ N . This motivates the introduction of normal 
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modes of the form u„(t) = 2 YlpLi Qp(*) cos(p7rn/iV), in 
terms of which the distance is 

oo 

d TO „(t) = d5 ) nn + 2^Q p (t)[cos(p^m/iV)-cos(p7rn/iV)]. 

p=i 

(1) 

This key equation shows that the distance fluctuations 
are linearly related to the fluctuations of the normal 
modes. If the normal mode fluctuations are Gaussian 
and Markovian, the distance fluctuations, by our previ- 
ous argument, are generically non-Mar kovian. 

The simplest model of polymer dynamics which yields 
a Gaussian and Markovian fluctuation for the normal 
modes is the Rouse model [8( . Here, we impose Rouse- 
like dynamics on the harmonic deviations u„, 



c 



du n (t) 3k B Td 2 u n (t) 



at 



b 2 dr 



(2) 



so that the overdamped dynamics of the fluctuations is 
a balance between the frictional force proportional to £ 
times the velocity of the n— th monomer, an entropic 
restoring force proportional to 3fcsT/6 2 which arises due 
to the connectivity of the chain, and the injection of ther- 
mal fluctuations from the solvent. The constant term in 
the Rouse mode expansion describing the motion of the 
center of mass of the chain and has been ignored here 
since it does not enter the expression for the distance. 
The dynamics of the Rouse modes follows immediately 



dQ P (t) 
dt 



-k p Q p (t) + F p (t), 



(3) 



where ( p = 2N(, k p = 6p 2 TT 2 k B T/Nb 2 and f n (t) = 
2 J^Li F p (i) cos(pTrn/N). The fluctuations of the Rouse 
modes are, therefore, identical to the fluctuations of the 
velocity of a Brownian particle, both being governed by 
the OUP. The correlations between the modes is given 
by 1 



(Q P (t)-Q,(* + r)> 



Nb 2 



(p 2 + q 2 )~K- 



■exp(~_p 2 r/ri), 



(4) 

showing that each Rouse mode has a distinct relaxation 
time and is unocorrelated with every other Rouse mode. 
Here, T\ = Nb 2 C, p /6TT 2 kBT is the relaxation time of the 
first Rouse mode. 

Combining the results of Eq. 1, and Eq. 4, we see that 
d mrl (i) is a stochastic process which is an infinite super- 
position of OUPs. Explicitly, the correlation function 
p mn (j) = (d mn (t) • d mn (t + r)) of this process is 

oo ..,2 

Pmn(r) = 2 V -^[cos(p7rn/N)-co S (pnm/N)] 2 e- p2T / T \ 

p=l 

(5) 

By Doob's theorem, it is immediately clear that the dy- 
namics of d mn (t) is non-Markovian. Since d mn (t) is a 



Gaussian process, all higher order time correlation func- 
tions can be expressed in terms of the p m „(r) using 
Wick's theorem. We call the stochastic process defined 
by Eq. 1 and Eq. 3 the superposed Ornstein-Uhlenbeck 
process. The two-point correlation p m „(T) completely 
specifies the process. 

Our work thus far is, in a formal sense, a Marko- 
vian embedding (in terms of the normal modes) of a 
non-Markovian process (the distance fluctuations). Such 
Markovian embeddings are also used in describing the 
underdamped dynamics of a Brownian particle in a po- 
tential. The stochastic process describing the position 
alone is non-Markovian, but the joint process in the en- 
larged set of position and velocity variables is Markovian 
Q. In the present case, the Markovian embedding is also 
Gaussian, and it is this simplification that allows us to 
calculate all correlation functions in terms of the two- 
point correlation p m „(r). 

We now turn to comparingour analytical results with 
data from the experiments [H, 0, Sj- The experiments 
measure the fluorescence liftctimc 7 _1 (£) which is related 
to the distance d mn {t) = y d mn (t) • d mn (t) between the 
fluorophorc and the quencher as 



7(<) = k Q e 



-\d„ 



(6) 



where ko, A are parameters determined by the protein, 
d m „(t) = d^ n + u m {t) - u n (t) and e^„ is the mean 
value of the distance. Then correlation functions of the 
lifetimes c>7 -1 (i) = 7~ 1 (<) — (7 -1 ) ar e related to correla- 
tions in the distance fluctuations. This relation is simple 
for the two-point correlation function [J], 

(7) 

but becomes complicated for three- and higher-point cor- 
relations. Explicit forms for three and four point corre- 
lations are given in 

We compare the results for two- and four-point fluores- 
cence lifetime correlations using the correlation functions 
of the superposed OUP. The experimentally known val- 
ues of the parameters are d^ nn = 4.5A, [3 = 1.4A -1 , and 
j/ksT = 0.48A~ 2 s Q. With fitting parameters which 
are very close to these estimates, the agreement between 
the theoretical prediction and the experimental data for 
the two-point correlation is good over the entire 5 decades 
in time, as shown in Fig. 1. In Fig. 2 we compare theory 
and experiment for the four-point function by fitting the 
same parameters used in Fig. I. Again, the agreement 
is good over the full 5 decades in time. 

The interesting symmetry in time of the three-point 
correlation function (S^^ 1 (O)^ -1 (ti)S'j^ 1 (t\ + t2)) — 
(^7 _1 (0)(57 _1 (t2)(57 _1 (ti + t 2 )) observed in experiment 
[J] holds by construction for the superposed OUP. This 
is so because, by stationarity, all times indices in the first 
term of the identity can be shifted by ti+t2, which repro- 



3 




4 

„" 35 

& 3 
v 

~A 

5. 2.5 

to 

S 2 

S 1.5 
S. 1 
" 0.5 









o\° 

<«?\ 













































FIG. 1: The normalised two-point (above) and four-point 
(below) autocorrelation functions of the fluorescence lifte- 
times plotted against time t in seconds. The solid line is 
the theoretical curve obtained by using the the normalized 
correlation function in Eq. [5] while the data is from Ref. 
The theoretical curve has been multiplied by the first data 
point. The parameters used are f3 = 1.3A - 1 , d^n — 2.8A, 
7/fc s T = 0AA~ 2 s, m = 6, n = 30 and TV = 30. 



duces the second term of the identity. Similar relations 
also hold for the higher-point correlation functions. 

The Rouse-like dynamics of the normal modes, then, 
can capture all the essential features of the experimen- 
tal fluctuation spectrum both qualitatively and quantita- 
tively. This provides evidence that the large-scale long- 
time behaviour of proteins are identical to those of struc- 
tureless Rousian polymer chains, not only for statics 0], 
but also for aspects of dynamics. 

To fully specify the superposed OUP we must obtain 
all the joint probabilities P(d, t; d',t'; d", t"; . . .) on the 
sample paths d mn (t) = d [6] . As the process is Gaussian, 
all such probabilities are multivariate Gaussian distribu- 
tions determined by the single function p m „(r)_9]. The 
one-point distribution P(d, t) is independent of time by 
stationarity and is a Gaussian in d with mean d t ^ nn and 



variance /? mn (0) 
P(d,t;d',0 = 

where t' = t- 



Nb 2 . The two-point distribution is 



1 



;27r) 3 det[C(T) 



■exp[ 



(8) 

-t, A = {d x ,d y ,d z ,d' x ,d' y ,d' z ), and CV, (t) = 
(Ai^A^t + r)) = f A . A . A i A i P(d,t;d',f) is the 6 x 6 
matrix of correlations. Since the only non-zero correla- 
tions are of the type (d x (t)d x (t)} or (d x (t)d x (t + t), this 
matrix is band-diagonal, with ^p mn (0) on the main di- 
agonal and ■^p m n{ T ) on the upper and lower diagonals 
two removed from the main diagonal. The higher joint 
probabilities have similar forms, but with enlarged vector 
A and enlarged matrix C jsj . From the Gaussian nature 
of the process, and by Wick's theorem, the four-point 
correlation function pm n (T, t',t") = (d mn (t) ■ d mn (t + 
T )d mn {t + T ') • d mn (t + t")) may be calculated as sums 
of products of the two point correlation function p mn (t) . 
It is important to emphasis that Xie et al have explicitly 
checked that the higher order correlations obey Wick's 
theorem and haver thereby confirmed that the distance 
fluctuations are Gaussian. 

We have been able to characterise the stochastic pro- 
cess governing the distance fluctuations in terms of its 
correlation function and the joint probabilities. It is of 
interest to ask if there is an effective description, in terms 
of a suitable Langevin equation, for the distance fluctu- 
ation d mn (t) itself. In other words, is it possible to con- 
struct a Langevin equation for d mn (t), given its correla- 
tion function p mn (t) ? Fox Q has provided a solution to 
this inverse problem in terms of a linear Langevin equa- 
tion with a memory kernel and Gaussian, coloured noise. 
The effective Langevin equation for the superposed OUP, 
then, is 



at 



(*) = 



- / D mn (t - t')d Tl 
Jo 



l {t') + f mn (t) (9) 



where the noise is Gaussian with mean (f mn (t)) = and 
variance {f mn i(t)f mrl j(t + T)) = 2fc B TL» mn (r). It can be 
shown that this Langevin equation yields the correla- 
tion function p mn (j) and the full heirarchy of joint prob- 
abilities described above, provided the Laplace transform 
of the diffusion kernel D mn (s) = dr exp(— sr)D mn (r) 
satisfies 



Dmn (^) 



(10) 



where p m „(s) is the Laplace transform of p m nij)- An ex- 
plicit expression for D mn (s) can be obtained by Laplac- 
ing transforming Eq|5l replacing the summation by an 
integration, and inserting the expression for p mn (s) in 
the equation above. For m = 0, n = N, we get for 
D(s) = D 0N (s), 



D(s) = 



n 1 



= [§ -tan- x ( : 



(11) 
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From this, we may obtain an effective friction kernel £(s) 
using the generalised Stokes-Einstein relation D(s) = 
/cbT/C(s). This effective friction kernel recieves contribu- 
tions from individual dissipative effects of all the Rouse 
modes and is the source of memory. For st\ <C 1, which 
corresponds to times much greater than the longest re- 
laxation time, £(s) — * T\, which implies that the mem- 
ory kernel is proportional to 5(t). Reassuringly, the 
Markovian limit is reproduced for times much longer 
than the longest time scale in the problem. On the 
other hand, for sti ^> 1, ((s) — > \\Jt\) 's, implying 
that is proportional to {ri/t) 1 / 2 . Thus, for times 
smaller than the longest relaxation time, the memory 
kernel shows a power law decay in time with an expo- 
nent of — A. Correspondingly, the normalized correlation 
function C(s) = p(s)/p(0) is 



C{s) = - 
s 



1 - 



1 



r 71 " 1/1 

.[ tan-^ — 



(12) 



where p(s) = Pon(s)- For sti ^S> 1, C(s) — * 1/s, which 
implies that at short times C(t) = 1. In the opposite 
limit of sti <C 1, C(s) « t\/(1 + st\), which implies 
that at long times C(t) is proportional to exp(— t/rx) 
and decays as a single exponential, again suggesting that 
the fluctuations become Markovian at long times. These 
asymptotes can be understood by a simple scaling argu- 
ment. The normalized correlation function is given by 
c (t) = J2 P ,odd e ~ p2t/Tl /P 2 - 11 is evident from the ex- 
pression for C(t) that for times t •> ti, when all but 
the longest modes have relaxed, C(t) decays as a single 
exponential. For times t 3> Ti/p 2 , all modes above the 
p-th mode have already relaxed and do not contribute to 
the summation. This implies that at any time t, modes 
p = l,2...p*(t) contribute to the summation, where 
p*(t) w (ri/t) 1 / 2 . The slow decay of this maximum mode 
number shows up as a non-Markovian effect and is ulti- 
mately responsible for the power-law dependence seen in 
the frictional memory kernel. 

Our work has precursors in the contribution of Kou 
and Xie [4j who proposed a one-dimensional phcnomcno- 
logical Langevin equation incorporating fractional Gaus- 
sian noise. However, no microscopic basis was provided 
for memory kernel or for the form of the noise. Start- 
ing from a microscopic description, we obtain a vector 
Langevin equation with memory, obtain an explicit ex- 
pression for the memory kernel, and show that this has a 
power law decay. The noise in our Langevin equation is 
not a phenomenological fractional Gaussian noise, but is 
fully specified from the microscopies. Debnath et al [T3| 
use a semi-flexible model of polymer dynamics to explain 
the experimental data. Our work shows that semifiexibil- 
ity is not necessary to understand those aspects of pro- 
tein dynamics probed by the fluorescence experiments. 
A similar approach is that of Tang and Marcus [ll| for 
a flexible polymer, but the experimental results can be 



obtained only by assuming a certain disorder along the 
chain and averaging over the disorder. Our work shows 
that the heterogeneity of the protein residues is not nec- 
essary to explain the experimental results. None of the 
above have elucidated the appearence of non-Markovian 
behaviour from a simple superposition of Markovian fluc- 
tuations as is done here. 

In summary, we have presented a mechanism where 
a superposition of the Markovian dynamics of the nor- 
mal modes of the protein conformation gives rise to 
non-Markovian fluctuations of the distance between two 
points in the protein. The model provides an accurate 
fit to experimental data for both two-time and four-time 
correlation functions, exhibits a symmetry of these cor- 
relations functions found experimentally, reproduces the 
power-law decay of the frictional memory kernel, and 
clarifies how non-Markovian behaviour arises in protein 
dynamics. Given the non-specific nature of our model, we 
believe that non-Markovian fluctuations should be seen 
universally in all biopolymers and not only in proteins. 
Our model can be extended to include more detailed de- 
scriptions of the protein normal modes and their relax- 
ation mechanism. Using our model, several quantities of 
interest in fluorescence microscopy like the survival and 
first passage times of the distance may be calculated. 
Work is underway to explore these possibilities. 

This work was first presented at the 2008 Biophysics 
Summer School, at the Harishchandra Research Insti- 
tute, Allahabad. RA wishes to thank the organisers and 
participants for useful comments and suggestions. 
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